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The Calculus of Wrapped Compartments (C WC) is a variant of the Calculus of Looping Sequences 
(CLS). While keeping the same expressiveness, CWC strongly simplifies the development of auto- 
matic tools for the analysis of biological systems. The main simplification consists in the removal of 
the sequencing operator, thus lightening the formal treatment of the patterns to be matched in a term 
(whose complexity in CLS is strongly affected by the variables matching in the sequences). 

We define a stochastic semantics for this new calculus. As an application we model the interac- 
tion between macrophages and apoptotic neutrophils and a mechanism of gene regulation in E.Coli. 

1 Introduction 

In computer science, several formalisms have been proposed for the description of the behaviour of 
biological systems. Automata-based models ||2j |22l have the advantage of allowing the direct use of 
many verification tools such as model checkers. Rewrite systems |[T3l l25l IH usually allow describing 
biological systems with a notation that can be easily understood by biologists. Compositionality allows 
studying the behaviour of a system componentwise. Both automata-like models and rewrite systems 
present, in general, problems from the point of view of compositionality, which, instead, is in general 
ensured by process calculi, included those commonly used to describe biological systems |[26ll24l l9l. 

The Calculus of Looping Sequences [5] (CLS) has been introduced to describe microbiological sys- 
tems, such as cellular pathways, and their evolution. The terms of the calculus are constructed by basic 
constituent elements and operators of sequencing, looping, containment and parallel composition. The 
looping operator allows tying up the ends of a sequence, thus creating a circular sequence which can 
represent a membrane. The presence of sequences and loops, however, makes more difficult to develop 
a correct and efficient implementation of the associated simulation tools. The main difficulties come 
from the complexity of the pattern matching algorithms which should take sequences (and all their sub- 
sequences) into account. Moreover, the use of sequences and sequence variables seems prone to subtle 
semantic ambiguities in rules definition. 

In this paper we propose the Calculus of Wrapped Compartments (CWC for short), a variant of 
CLS suitable for efficient implementation. In CWC, starting from an alphabet of atomic elements, we 
build multisets of elements and compartments. We are able to localise elements by compartmentalisation 
and we allow to specify the structure of the compartment (e.g., detailing the elements of interest on a 
membrane). The evolution of the system is driven by a set of rewrite rules modelling the reactions of 
interest. The main difference with respect to CLS consists in removing the sequencing operator and 
using multisets (instead of ordered sequences) of atoms to represent membranes. Such a choice, while 
preserving the Turing completeness of the calculus, strongly simplifies the pattern matching algorithm 
and the underlying basic theory. 
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In our previous experience with biological models we realized that sequences were not used in many 
situations: cellular pathways are usually represented with a greater abstraction than the one which uses 
amino acids and nucleotides to depict proteins and genes - moreover representing the primary sequence 
of a protein without considering secondary and tertiary structure is seldom useful. CLS used looping 
sequences to represent membranes, but we believe that multisets of atomic elements are a more natural 
representation, which reflects the fluid mosaic model |28|. A similar approach is taken in the CLS+ 
model (see [6|), a variant of CLS in which the looping operator can be applied to a parallel composition 
of sequences. 

We provide CWC with a stochastic operational semantics from which a continuous time Markov 
chain can be build. With respect to [4J, in which a constant rate is associated to the rewrite rules, we 
allow a more flexible treatment of the dynamics. In particular, we associate a function to each rewrite 
rule according to which the rate of the transition can be computed depending on the initial and the target 
terms. We show how the law of mass action can be encoded in our stochastic semantics thus following 
the standard Gillespie's approach ifTTl . 

Summary. In Section [2] we introduce the new formalism and show its Turing completeness. In Sec- 
tion |3] we provide some guidelines for the modelling of common biological events. We give some detail 
on how the interaction between macrophages and apoptotic neutrophils, and a mechanism of gene reg- 
ulation can be described in the new calculus. In Section |4] we present a stochastic semantics of the 
calculus. In Section [5| we report some simulations regarding the gene regulation mechanism introduced 
in Section|3] Finally, in Section|6]we draw some conclusions. 



2 The Calculus of Wrapped Compartments 

In this section we present the nondeterministic Calculus of Wrapped Compartments (CWC). 
2.1 Terms and Structural Congruence 

A term of the CWC calculus is intended to represent a biological system. A term is a sequence of simple 
terms. Simple terms, ranged over by t, u, v, w, are built by means of the compartment constructor, 
(—J — ), from a set jz/ of atomic elements {atoms for short), ranged over by a, b, c, d. The syntax of 
simple terms is given at the top of Figure [T] We write t to denote a (possibly empty) sequence of simple 
terms t\---t„. Similarly, with a we denote a (possibly empty) sequence of atoms. The set of simple terms 
will be denoted by ^. The set of terms (sequences of simple terms) and the set of sequences of atoms 
will be denoted by 3^ and £/, respectively. Note that C 

A term t = t\ - --tn should be understood as the multiset containing the simple terms , . . . , f„. There- 
fore, we introduce a relation of structural congruence, following a standard approach in process algebra. 
The CWC structural congruence is the least equivalence relation on terms satisfying the rules given at 
the bottom of Figure [l] From now on we will always consider terms modulo structural congruencej^ 
Then a simple term is either an atom or a compartment (a J t) consisting of a wrap (represented by the 
multiset of atoms a) and a content (represented by the term t). We write the empty multiset as • and 
denote the union of two multisets m and v as m v. 

' In the implementation of tlie derived tools it will be useful to consider a normalized representation of these terms suitable 
for efficient manipulation. In the description of the calculus given here however we will ignore implementation issues. 
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Simple terms syntax 

t ::= a I {a\ t) 



Structural congruence 

t u w V = t w u V if a = b and t = u then (a J t) = {b\ u) 



Figure 1: CWC term syntax and structural congruence rules. 



a b c| a b c| 



Ci) (ii) (iii) 

Figure 2: (i) represents {a b cj •); (ii) represents (a b c\{d e\»)); (iii) represents 
{a b c\ {d e\ •) f g). 

An example of term is a b {c d \e f) representing a multiset consisting of two atoms a and b (for 
instance two molecules) and a compartment {c d \e f) which, in turn, consists of a wrap (a membrane) 
with two atoms c and d (for instance, two proteins) on its surface, and containing the atoms e (for 
instance, a molecule) and / (for instance a DNA strand). See Figure|2]for some graphical representations. 

2.2 Systems and Reduction Semantics 

Rewrite rules are defined as pairs of terms, in which the left term characterizes the portion of the system 
in which the event modelled by the rule can occur, and the right one describes how that portion of the 
system is changed by the event. 

In order to formally define the rewriting semantics, we introduce the notion of open term (a term 
containing variables) and pattern (an open term that may be used as left part of a rewrite rule). In 
order to respect the syntax of terms, we distinguish between "wrap variables" which may occur only in 
compartment wraps (and can be replaced only by multisets of atoms) and "term variables" which may 
only occur in compartment contents or at top level (and can be replaced by arbitrary terms). Therefore, 
we assume a set of term variables, i^, ranged over by X,Y,Z, and a set of wrap variables, ranged 
over by x,y,z- These two sets are disjoint. We denote by y the set of all variables Y-^Wf^, and with p 
any variable in Y. 

Definition 2.1 (Open Terms and Patterns) 

(i) Open terms are terms which may contain occurrences of wrap variables in compartment wraps 
and term variables in compartment contents or at top level. They can be seen as multisets of 
simple open terms. More formally, open terms, ranged over by O and simple open terms, ranged 



a b c| 
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over by o, are defined in the following way: 

O ::= o 

o ::= a | X | {dx\o) 
(ii) An open term is linear if each variable occurs at most once in it. 

(Hi) Simple patterns, ranged over by S, are the linear open terms defined in the following way: 

S ::= t I {ax\SX) 

where t is an element of ,5^, a is an element of s^, x is a variable in 'f-^, S is a possibly empty 
multiset of simple patterns and X is a variable in 'Y-^. We denote with 5^ the set of simple patterns. 

( iv) Patterns, ranged over by P, are the linear open terms defined in the following way: 

P ::= SSX 



where S S is a nonempty multiset of simple patterns and X is an element of Y-^. We denote with 
^ the set of patterns. 

An instantiation is a partial function o '.Y ^ ^. An instantiation must preserve the type of variables, 
thus for Z G Y-^ and x G Y-^ we have a(Z) G 5^ and a{x) G respectively. Given O G ^, with Oa we 
denote the term obtained by replacing each occurrence of each variable p eY appearing in O with the 
corresponding term C7(p). With £ we denote the set of all the possible instantiations, and, given O E ff, 
with Var{0) we denote the set of variables appearing in O. Now we can define rewrite rules. 

Definition 2.2 (Rewrite Rules) A rewrite rule is a pair of a pattern P and an open term O, denoted with 
P^O, such that Var{0) C Var{P). 

A rewrite rule P^O states that a term Pa, obtained by instantiating variables in P by some in- 
stantiation function o, can be transformed into the term Oc. A set of rewrite rules define a notion of 
reduction between terms of CWC which are intended to represent the (nondeterministic) behaviour of 
the represented system. 

Definition 2.3 A CWC system over a set of atoms is represented by a set for short when 

is understood) of rewrite rules over . 

We define the semantics of CWC as a transition system in which states correspond to terms and 
transitions correspond to rule apphcations. To this aim we need to define the notion of contexts (i.e. 
terms containing a hole). 

Definition 2.4 (Contexts) Contexts C are defined as: 

C ::= □ I (aJC) t 

where d^s^ and t^S^. The context □ is called the empty context. We denote with ^ the set of contexts. 
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By definition, every context contains a single hole □. Let us assume C,C' G 'rf. With C[i] we denote 
the term obtained by replacing □ with t in C; with C[C'] we denote context composition, whose result is 
the context obtained by replacing □ with C' in C. For example, given C = {a b\ □) i 1,C' = {c d \n) g h 
and t = e f, we get C[C'[?]] = {a b\ (c d\e f) g h) i I. Note that context holes take the place either of 
the whole term or of the whole content of a compartment. This allows to make context unambiguous in 
the following sense. Let's extend the notion of subset (denoted as usual as C) between terms interpreted 
as multisets. 

Proposition 2.5 (Uniqueness) For any term t if the term u occurs in t within a compartment content or 
at top level, then there are, modulo =, a unique context C and a unique term t' such that t = C[t'] and 

Given a CWC system ^ the associated notion of reduction is then defined in the following way. 

Definition 2.6 (Reduction) Given a CWC system , the ^-reduction relation between terms of CWC 
is defined by the following rule: 

P^Oe^ aeL t = Pa u = Oa Ce'tf 
C[t] ^^C[u] 

closed under structural congruence, i.e.: 

u = u' and u' i— )-,^ v' and V = v u^^v 



We will write simply i— )• instead of^^g: when ^ is understood. 
Remark 2.7 

{i) Taken a pattern P = S S X (representing the reactants of the reaction that will be simulated) the 
crucial point for determining an application of P to the whole biological ambient t is the choice 
of the occurrences of simple terms matching with S S ( determining the compartment in which 



this reaction will take place). By Proposition 2.5 and the linearity condition both the context C 



(such that t = C\t'\) and the term replacing X ( the part oft' that does not match S S) are uniquely 
determined. This will play a role in the implementation of the stochastic calculus. 

(ii) There can be, in general, many different substitution such that P(J = t but not all of them produce 
the same term Oo (modulo =). This is not particularly relevant for the case of qualitative systems 
but we will need to take it into account in the stochastic formulation. 

Notation 2.8 

(i) We write S S ^ O as short for S S X O X, where the variable X does not occur in Var{S S) U 
Var{0). 

(ii) We write a a ^ b as short for the rewrite rule (a a x\Y) Z i— )■ (b x\ Y) Z, where the variables 
X, Y,Z are distinct. 

The CWC is Turing Complete. In the following we sketch how Turing Machines can be simulated 
by CWC models. 



Theorem 2.9 (Turing Completeness) The class of CWC models is Turing complete. 
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Biomolecular Entity 


CWM term 


Elementary object (genes, domains, etc..) 


Atoms 


Molecular population 


Term multiset 


Membrane 


Atom multiset 



Table 1: Guidelines for the abstraction of biomolecular entities into CWC. 



Proof: (Sketch). A Turing machine T over an alphabet ZU {jl} (where fl represent the blank) with a set 
Q of states can be simulated by a system of CWC in the following way. 

Take £/ = QULU {'^,l,r}, where I and r are special symbols to represent the left and right ends of the 
tape. The tape of the Turing machine can be represented by a sequence of nested compartments whose 
wraps consist of a single atom (representing a symbol of the tape). The content of each compartment 
defined in this way represents a right suffix of the written portion of the tape, while the atom on the 
wrap represents its initial (w.r.t. the suffix) symbol. In each term representing a tape there is exactly 
one compartment which contains a state (the present state). For example, the term {I \ {a \ q {b \ {r \ •)))) 
represents the configuration in which the tape is ...jJ,a,Z7, jj.... (the remaining positions are blank), the 
machine is in state q and the head is positioned on b. Rewrite rules are then used to model the machine 
evolution. We can define rules creating new blanks when needed (at the tape ends) to mimic a possibly 
infinite tape. For instance a transition {q,b) — )■ {q' ,c, right) (meaning that in state q with a.b on the head 
the machine goes in state q' writing c on the tape and moving the head right) is represented by the rule^ 

(1) q{by\ix\Z)Y)X^{cy\q' {x\Z)Y)X 

(2) q {b y\ (rj Z)Y)X^ {cy\ q' («J (rj Z)) Y) X 

The second rule represents the case that b is the rightmost non blank symbol and so a new blank must be 
introduced in the simulation. [^By construction, the system correctly represents the behaviour of T . 

□ 



3 Modelling Guidelines 

In this section we will give some explanations and general hints about how CWC could be used to 
represent the behaviour of various biological systems. 

In rewrite systems, such as CWC, entities are usually represented by terms of the rewrite system, 
and events by rewrite rules. We have already introduced the biological interpretation of our terms in 



Section 2. 1 Here we will go deeper and describe the most straightforward abstraction that can be used 
to define terms and rules. 

First of all, we should select the biomolecular entities of interest. Since we want to describe cells, 
we consider molecular populations and membranes. Molecular populations are groups of molecules that 
are in the same compartment of the cells and inside them. As we have said before, molecules can be of 
many types: we classify them as proteins, chemical moieties and other molecules. 

Membranes are considered as elementary objects: we do not describe them at the level of the phos- 
pholipids they are made of. The only interesting properties of a membrane are that it may have a content 



^In these rules, in a Turing Machine simulation, the variables X and Y are always instantiated with the empty multiset 
^Note that rule (1) could be applied also in this case but it would lead the system in a configuration (containing a subterm 
(rJ t) for some t) from which no further move could be possible. However situations of this kind can be easily avoided with 
a little complication of the encoding. 
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Biomolecular Event 


Examples of CWC Rewrite Rules 


State change 


b 


Complexation 


a b ^ c 


Decomplexation 


c ^ a b 


Catalysis 


c S S ^ c where 5 5 =^ is the catalyzed event 


State change on membrane 


a ^ b 


Complexation 
on membrane 


a b ^ c 

a {bx\X) {cx\X) 
{b x\aX)^ {c x\X) 


Decomplexation 
on membrane 


c ^ a b 

{cx\X)^a {bx\X) 
{c x\X)^ {b x\aX) 


Catalysis on membrane 


c a a ^ c b where a a — )■ is the catalyzed event 


Membrane crossing 


a {x\X) =^ {x\ a X) 
{x\ aX)^a (xJ X) 


Catalyzed 
membrane crossing 


a {bx\X) {bx\aX) 
{bx\aX)^a {bx\X) 


Membrane joining 


a {x\X) =^ {a x\X) 
{x\ a X) ^ {a x\ X) 


Catalyzed 
membrane joining 


a {bx\X)^{abx\X) 
{bx\aX)^labx\X) 
(xJ a b X)^ {a x\b X) 


Membrane fusion 


{a xJ X) {b y\Y)^{axby\X Y) 


Vesicle dynamics 


{x\X {ay\Y))^Y {ayx\X) 



Table 2: Guidelines for the abstraction of biomolecular events into CWC. 



(hence, create a compartment) and that in its phospholipid bilayer various proteins are embedded, which 
act for example as transporters and receptors. Since membranes are represented as multisets of the 
embedded structures, we are modelling a fluid mosaic in which the membranes become similar to a 
two-dimensional liquid where molecules can diffuse more or less freely ESll . 

Now, we select the biomolecular events of interest. The simplest kind of event is the change of 
state of an elementary object. Then, there are interactions between molecules: in particular complex- 
ation, decomplexation and catalysis. Interactions could take place between simple molecules, depicted 
as single symbols, or between membranes and molecules: for example a molecule may cross or join a 
membrane. Finally, there are also interactions between membranes: in this case there may be many kinds 
of interactions (fusion, vesicle dynamics, etc. . . ). 

The guidelines for the abstraction of biomolecular entities and events into CWC are given in Table[T] 
and Table [2j respectively. Biomolecular events are associated with rewrite rules. In the table we give 
some examples of rewrite rules for different kinds of events. The list has not the ambition of being 
complete but to give some general hints on rules definition. 
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Figure 3: Neutrophils phagocytosis: a) a macrophage distinguishes between apoptotic and living neu- 
trophils, b) the death neutrophile is "eaten" and embedded in a phagosome, c) the phagosome fuses 
with a lysosome. 

3.1 Modelling Examples 

We will now present two examples of possible CWC simulations, in order to testify that our calculus is 
able to express a broad range of biological situations even if the possibility to represent sequences, which 
is a computational burden, is removed from its syntax. 

Macrophages. All the biological problems involving cell interactions are feasible. One could for ex- 
ample represent macrophages and their interaction with the apoptotic cells: in order to "clean up" the 
remains of death cells they have to recognize them amongst the alive ones and then remove them by a 
process called phagocytosis, which consists in the internalization of the death cell in a membrane com- 
partment, called phagosome. The phagosome inside the macrophage will then join another membrane 
compartment called the lysosome in order to start the degradation of the ingested cellular components. 

To give the rules that could represent this process in CWC we decided to concentrate on the re- 
moval of neutrophils that have undergone apoptosis: removing them before their membrane collapses is 
crucial because they contain some lytic enzymes that, if released in the organism, would lead to tissue 
damages ifTSl [T2l . The mechanism which mediates the recognition of apoptotic neutrophils is not per- 
fectly clear but we decided to reproduce the one described in ||29l [H which suggest that neutrophils and 
macrophages interact via CD31 and that somehow the CD31 of vial neutrophils is capable of "escaping" 
the phagocytosis. 

We will represent a macrophage with a lysosome and a CD31 exposed on its membrane in this 
way (where M is a marker to distinguish macrophages from other cells and lyso represents a lysosome 
containing some lytic enzymes)]^ 

(CD31 MJ {lyso\ lyticEnz) innerM) 

A living neutrophile (where N is a marker for neutrophils and V indicates that this is a viable neutrophile) 
is represented by: 

(CD31 V NJ innerL) 
An apoptotic one (where A indicates its apoptotic state) is represented by: 

(CD31 A NJ innerA) 

^Our calculus allows to represent with simple atoms some higher level information such as that represented by marker 
names. 
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Figure 4: The Pho regulation system. 



where innerM, innerL and inner A denote the internal material of a macrophage, a living neutrophile and 
an apoptotic one, respectively. 

Now a fairly simple set of rules is sufficient to represent the whole process. 

The binding of a neutrophile and a macrophage mediated by their CD3 1 molecules is represented by 
encapsulating the interacting cells within a new compartment wrapped with the marker I: 

(CD31 M x\ X) (CD31 N jj F) ^ (Ij (CD31 M x\ X) (CD31 N y\ Y)) 

After the contact the viable neutrophils are released: 

(I z\ (CD31 M x\ X) (CD31 N V 3;] F) Z) ^ (CD31 M x\ X) (CD31 N V 3;] F) 

While the apoptotic ones are phagocyted (phago represents the phagosome which contains the phago- 
cyted neutrophile): 

(IzJ(CD31 Mx\X) (CD31 N A yj F) Z) ^ (CD31 M x \ {phago \ {CD31 N Ay\Y))X) 

The phagosome can join the lysosome to start hydrolysis (phagolyso represents the phagolysosome 
derived from the fusion): 

{lyso x\ lyticEnz X) {phago z\ (CD31 N A F) Z) ^ {phagolyso x z\ (CD31 N A F) lyticEnz X Z) 

This should show that interactions involving cells of the immune system can be represented in CWC 
with little effort at different levels of detail: modelling some aspects of the complex network that drives 
immunological defence could be useful to study many human pathologies and involved processes, like 
in this example about inflammation and how it stops. 

Phosphate regulation. Switching to a completely different field we would like to describe a mecha- 
nism of gene regulation fairly well defined in E.Coli and other bacteria, which is based on a two com- 
ponent system: (i) the sensor, which can react to changes in the environment and interact with the other 
component; (it) the response regulator, which is capable of determining, when activated by the sensor. 



M. Coppo, F. Damiani, M. Drocco, E. Grassi & A. Troina 



91 



the reactions necessary to cope with the new environmental conditions. This schema is common to drive 
gene regulation in bacteria: the activated response regulator will act as a transcriptional activator. 

We will model the system related to environmental phosphate concentration, where the sensor is a 
transmembrane protein located on the inner membrane of the bacteria, called PhoR, and the response 
regulator is a cytoplasmatic protein which can be activated by phosphorylation and is called PhoB. Envi- 
ronmental phosphate will freely diffuse in the periplasmic space (between the two bacterial membranes) 
thanks to the outer membrane pores and there it will bind to PhoR. When bound by phosphate PhoR will 
be silent, while it shows a kinase activity on PhoB when it is unbound; therefore low levels of phosphate 
will leave many PhoR unbound and able to activate PhoB, while high levels will lead to a smaller frac- 
tion of transmembrane unbound PhoR and thus to little activation of PhoB. When phosphorylated, PhoB 
activates the transcription of many genes aimed at helping the bacteria survive in low phosphate condi- 
tions |[2TI . These mechanisms have been recently described as related to the pathogenic potential |[T6ll of 
some bacteria and therefore studying them could also have some importance in medicine. 

We will now present the set of rules needed to model this regulation system; in the following we 
will refer to the outer membrane pores as pore, to phosphate as Pi, to the phosphate bound and unbound 
PhoR as PhoR and PhoRP, to the inactive and active PhoB as PhoB and PhoBP, to its target genes as 
PhoGenes and to the proteins that they encode as PhoProt. 

We may represent an E.Coli with the following CWC term: 

{pore\ (PhoR PhoRP \ PhoB PhoBP PhoGenes)) 

where, here, a sequence PhoR represents a multiset of PhoR atoms (similar for the other atoms). Two 
rules describe the phosphate diffusion in the periplasmic space: 

Pi {pore x\ X) =^ {pore x\ Pi X) 
{pore x\ Pi X) =^ Pi {pore x\ X) 

Two rules for the PhoR phosphate binding: 

Pi {PhoR x\X)^ {PhoRP x\X) 
{PhoRP x\ X) ^ Pi {PhoR x\ X) 

PhoB phosphorylation: 

{PhoR x\ PhoB X) {PhoR x\ PhoBP X) 

High level rules that represent the translation and traduction to proteins of the PhoB target genes and 
these proteins degradation: 

PhoBP PhoGenes PhoBP PhoGenes PhoProt 
PhoProt =^> • 

One last rule represents the bound form of PhoR phosphatase activity on PhoBP lITOll . which is a negative 
regulating loop of the whole mechanism: 

{PhoRP x\ PhoBP X) ^ {PhoRP x\ PhoB X) 

In Section|5]we will give some details about simulations of this model. 
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4 The Stochastic Calculus of Wrapped Compartments 

In order to make the formal framework suitable to model quantitative aspects of biological systems we 
must associate to each rule a numerical parameter (the rule rate) which determines (in a stochastic sense) 
the time spent between two successive interactions and the frequency with which each interaction will 
take place in order to represent faithfully the system evolution. The operational semantics will be defined 
by incorporating the rule rates in a stochastic framework along the lines of the one presented by Gillespie 
in ifTTl . The same approach has been incorporated, for example, in other simulators as the ones for the 
stochastic TT-calculus Il23ll24l . 

In this framework the rule rate is used as the parameter of an exponential distribution modelling 
the time spent between two occurrences of the considered chemical reaction. The use of exponential 
distributions to represent the (stochastic) time spent between two occurrences of chemical reactions 
allows describing the system as a Continuous Time Markov Chain (CTMC), and consequently allows 
verifying properties of the described system analytically and by means of stochastic model checkers. 

The basic choice for determining the rule rate of a reaction (suggested also by Gillespie's algorithm) 
is to obtain it by multiplying the kinetic constant of the reaction by the number of possible combinations 
of reactants that may occur in the system, representing in this way the law of mass action. 

In our calculus we will represent a reaction rate as a function of the context in which the reaction takes 
place. This allows to tailor the reaction rates on the specific characteristics of the system, as for instance 
when representing nonlinear reactions as it happens for Michaelis-Menten kinetics. The standard mass 
action rate can be describes as a particular choice of the rate function (see below). A similar approach is 
used in llT4i to model reactions with inhibitors and catalysers in a single rule. 

Obviously some care must be taken in the choice of the rate function: for instance it must be complete 
(defined on the domain of the application) and nonnegative. This properties are enjoyed by the function 
representing the law of mass action. 

Definition 4.1 A stochastic rewrite rule is a triple {P,R,f), denoted P i — > R, where {P,R) is a rewrite 
rule and / : J?' x ^ — )• M-" is the rate function associated to the rule^ 

We can then define, as in Definition |2.6t the "stochastic" reduction relation for CWC. In this case. 



we must also take into account that, by Remark 2.7 different instantiations that allow the l.h.s. P of a 



rule to match a term t can produce different outcomes which could determine different rates. This is 



shown in Example 4.2 So, in the definition of the rate function, we must also take into account the term 



produced by the reaction after the application of the rule. 

Example 4.2 Consider the rewrite rule a {b x\X) i— ^ {a b x\X) modelling a catalyzed membrane 
joining. In this case, the initial term a {b b\c) {b\c) results in two dijferent terms depending on which 
membrane will be joined by the element a; namely {a b b\ c) {b\ c) and (b b\ c) {a b\ c). The applica- 
tion of the rule, moreover, should take into account that the first case is catalyzed by the presence of two 
elements b on the membrane, while the second one has only one catalyzer 



A Stochastic CWC systems M,s^ (M for short) can be defined as in Definition 2.6 from a set ^ of 
stochastic reduction rules over . 



^The value in the codomain of / models the situations in which the given rule cannot be applied, for example when the 
particular environment conditions forbid the application of the rule. 
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Definition 4.3 (Stochastic Reduction) Given a stochastic CWC system the induced stochastic re- 
duction relation between terms of CWC is defined by the following rule: 

pJ-^oe^ oez t = Po u = 0o ce"^ 



C^]^^^C[u] 

f f 

We will write simply i — > instead of\ — when ^ is understood. 

Chemical-like Reactions. To define the "canonical" function that represents the law of mass action the 
application rate of a reduction rule should depend on the kinetic constant of the reaction and the number 
of possible reactants in the considered compartment in which the reaction will take place. Following an 
approach similar to that of miHl, this can be obtained counting the number of different ways in which 
the rule representing a reaction can be applied in the considered context producing the same outcome. In 
our setting, however, we can formalize the canonical function in a simpler way. 

To this aim we need a few definitions. Assume to have an infinite set of labels (e.g. the natural 
numbers) that can be attached to the atomic elements in =2/ to create, for each a G an infinite set of 
distinguished versions a\,a2,ai,, ... of the element. If a,- is a labelled atom, we write a = for the 
support of a,-. 

Definition 4.4 

(/) A labelled term of CWC is a term in which some labelled atomic element can occur. 

( ii) A complete labelled term is a term in which all atomic elements are labelled and no two labelled 
atoms having the same support have the same label. 

(Hi) If t is a labelled term, its erasure \t\ is the term obtained from t by erasing all the labels. 

Labels can be applied also to atoms occurring in open terms, but no label can be associated to a 
variable. 

If f is a labelled (possibly open) term a labeling t+ of t is obtained by assigning labels to the non 
labelled atoms of t such that f+ is a completely labelled termj^ For example, taking t = a\ a aj, b a 
possible labeling of f is f+ = a\ a2 ^3 bi. 

f - - 

Given a reduction rule P 1 — O and terms t,u such that for some instantiation a it holds Po = t and 
Og = u, we define the canonical rate function f{t,u) by the following steps: 

1. take any completely labeled version f+ of 7; 

2. let n be the number of distinct instantiations of variables with completely labeled terms such that 
P'o = t+, for some labehng P' of P, and \Ro\ = u^ 

3. define the function / as f{t, u) =k-n where k is the kinetic constant of the associated reaction. 

We denote with /™" the function obtained in this way to compute the evolution rate following the 
law of mass action with kinetic constant k. 



Note that since the labels of the atomic elements are all different, by Proposition 2.5 and Remark 2.7 



(i) each instantiation a such that P'o = f+ corresponds to a distinct combination of the elements that 



^Note that there are many possible labellings of t. 

^Note that the labehng of the atoms in P is taken only in order to allow matching. It is the choice of the variables the relevant 



factor 
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determine the reaction described by the rule P i — > O. So the number n of substitutions defined in point 
2. corresponds exactly to the number of possible distinct reactants for the considered reaction. 

This number can often be calculated in a rather simple way. For instance in a rule X a b i — > X c 
each instantiation of X that matches any term containing at least one a and one b corresponds exactly 
to a pair of occurrences of a and b (the ones not included in the variable's instantiation). So the total 
number of distinct instantiations of X (i.e. of possible reactants) is given by the product of the number 
of occurrences of a times the number of occurrences of b. 

Example 4.5 Consider the completely labeled term f+ = ai ai b\ defined above and the qualitative 
reduction rule a a X ^ a c X. The l.h.s. of the rule can be matched by the instantiations X = a\ b\, 
X = a2,bi and X = a3 b\. In the first case, the match is obtained by taking P' = a2 a^ X and so on. So, 
we have that f^"" (a a a b, a a c b) = 3-k where k is the kinetic constant of the reaction. 

Example 4.6 Conside r aga in the term t = a (b b\c) {b\c) and the rewrite rule a {b x\X) i— ^ (a b x\X) 

2-k 

If the function f is defined as the canonical function f^"", we get t i — > 



4.2 



described in Example 
{a b b\ c) {b\ c) and 1 1-^-^- {b b\ c) {a b\ c). 

Remark 4.7 Note that the canonical function f-"" has the same meaning of the function G defined in 

m. 



5 An application 

We used a prot otype simulator for CWC to run some simulations for the Pho regulation system described 



1^ 



k 



in Section |3.l| In figurep i we use =^> to indicate that we use the canonical function with kinetic constant 
k. The rates here are chosen manually following these considerations: diffusion of phosphate across the 
outer membrane pores has the same speed in both directions, PhoRP phosphatase activity is slow with 
respect to its kinase activity in the unbound form, both these mechanism are slower than the simpler 
binding and unbinding of phosphate to PhoR and the rules that describe translation and traduction and 
protein degradation should be rather slow. 

Pi {pore x\X)^ {pore x\Pi X) 

{pore x\Pi X)^ Pi {pore x\X) 

Pi {PhoR x\ X) "4^ {PhoRP x\ X) 

{PhoRP x\ X) Pi {PhoR x\ X) 

{PhoR x\ PhoB X) {PhoR x\ PhoBP X) 

PhoBP PhoGenes ^'^^ PhoBP PhoGenes PhoProt 

r,, „ 0.00008 

PhoProt • 

{PhoRP x\ PhoBP X) °-4" {PhoRP x\ PhoB X) 



Figure 5: Stochastic rules for the phosphate regulation mechanism. 
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Pho-B target (Low extracellular PI) 

PhoB target (High extracellular PI) 
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Figure 6: Levels of PhoB target proteins over time with different extracellular phosphate levels. 



Perlplasmic PI PhioB target 
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Figure 7: Levels of PhoB target proteins and periplasmic phosphate in the low extracellular phosphate 
conditions. 



We performed two simulations: both of them started with 10 PhoB, 5 PhoR, 5 PhoRP and a PhoGenes 
element in a single bacterial cell. To reflect high and low extracellular phosphate we started with 20 and 5 
extracellular Pi, respectively. In Figure[6]we show the comparison of the PhoProt levels obtained in these 
simulations: as expected in the low phosphate condition the average level of the PhoB target proteins is 
higher. Figure [7] shows the relation between PhoProt levels and periplasmic phosphate for the low phos- 
phate simulation: oscillations in the protein levels are in sync with the levels of periplasmic phosphate: 
when the latter increases the former decreases and viceversa. Figure [8] shows the beginning of the same 
simulation but focuses on the relationships between PhoRP, PhoPB and periplasmic phosphate: PhoRP 
levels follows with a little delay the phophate ones, while PhoBP shows a clearly specular behaviour 
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Figure 8: Levels of phosphorilated PhoB and phosphate bound PhoR in the low extracellular phosphate 
conditions. 



with respect to PhoRP, as expected. 

In the future we will like to compare the results of these simulations with available experimental 
data 11. 



6 Conclusions 

As we have seen, C WC allows to model cellular interaction, localisation and membrane structures. Other 
formalisms were developed to describe membrane systems. Among them we cite Brane Calculi lU and 
P-Sy stems Ii25il . 

CWC can describe situations that cannot be easily captured by the above mentioned formalisms, 
which consider membranes as atomic objects. Representing the membrane structure as a multiset of 
the elements of interest allows the definition of different functionalities depending on the type and the 
number of elements on the membrane itself. 

From a qualitative point of view, our calculus is essentially a variant of CLS f5\, while, from a 
quantitative point of view, the possibility to model the speed of reactions using functions instead of 
constant rates makes the CWC stochastic semantics more general with respect to the one of SCLS ||4l. 
In this sense, CWC stochastic semantics is more similar to the one in [14|. 

The current simulator for CWC is implemented in OCaml (see [ 1 ]). Since we do not need to analyse 
sequence structures and variables, the pattern matching algorithm has been simplified with respect to the 
simulator for SCLS described in [27 J. 

We are also considering to enrich our analysis framework by taking into account statistical model 
checking as done in [7] for stochastic CLS models, and with techniques to approximate the value of 
unknown kinetics in the stile of |[T9ll20l[TT]| . 



M. Coppo, F. Damiani, M. Drocco, E. Grassi & A. Troina 



97 



References 

[1] M. Aldinucci, M. Coppo, F. Damiani, M. Drocco, E. Giovannetti, E. Grassi, and A. Troina. SCWC-bio- 
simulator. Dipartimento di Informatica, Universita di Torino, 2010. http : //scwc-sim . sourcef orgeT] 
Inet/, 

[2] R. Alur, C. Belta, and F. Ivancic. Hybrid modeling and simulation of biomolecular networks. In HSCC, 
volume 2034 of LNCS, pages 19-32. Springer, 2001. 

[3] JH. Back, YJ. Kang, and SY. Lee. Transcript and protein level analyses of the interactions among phob, phor, 
phou and crec in response to phosphate starvation in escherichia coli. FEMS Microbiol Lett, 277:254-259, 
2007. 

[4] R. Barbuti, A. Maggiolo-Schettini, P. Milazzo, P. Tiberi, and A. Troina. Stochastic calculus of looping 
sequences for the modelling and simulation of cellular pathways. Transactions on Computational Systems 
Biology, IX-M-\U, loo's,. 

[5] R. Barbuti, A. Maggiolo-Schettini, P. Milazzo, and A. Troina. A calculus of looping sequences for modelling 
microbiological systems. Fundam. Inform., 72(l-3):21-35, 2006. 

[6] Roberto Barbuti, Andrea Maggiolo-Schettini, Paolo Milazzo, and Angelo Troina. The calculus of looping 
sequences for modeling biological membranes. In Workshop on Membrane Computing, volume 4860 of 
Lecture Notes in Computer Science, pages 54-76. Springer, 2007. 

[7] T. A. Basuki, A. Cerone, and P. Milazzo. Translating stochastic els into maude. Electr Notes Theor Comput. 
227:37-58, 2009. 

[8] S. Brown, 1. Heinisch, E. Ross, K. Shaw, CD. Buckley, and J. Savill. Apoptosis disables CD31 -mediated cell 
detachment from phagocytes promoting binding and engulfment. Nature, 418:200-203, 2002. 

[9] L. Cardelli. Brane calculi. In CMSB, volume 3082 of LNCS, pages 257-278. Springer, 2005. 

[10] DO. Carmany, K. Hollingsworth, and WR. McCleary. Genetic and biochemical studies of phosphatase activ- 
ity of phor JBacterioL, 185:1112-1115,2003. 

[11] A. Coletta, R. Gori, and F. Levi. Approximating probabilistic behaviors of biological systems using abstract 
interpretation. Electr Notes Theor Comput. ScL, 229(1): 165-182, 2009. 

[12] G. Cox, J. Crossley, and Z. Xing. Macrophage engulfment of apoptotic neutrophils contributes to the resolu- 
tion of acute pulmonary inflammation in vivo. Am J Respir Cell Mol Biol., 12:232-237, 1995. 

[13] V. Danos and C. Laneve. Formal molecular biology. Theor Comput. Sci., 325(1):69-1 10, 2004. 

[14] M. Dezani-Ciancaglini, P. Giannini, and A. Troina. A type system for a stochastic els. InMeCBIC, volume 1 1 
of EPTCS, pages 91-106, 2009. 

[15] B. Fadeel and VE. Kagan. Apoptosis and macrophage clearance of neutrophils: regulation by reactive oxygen 
species. Redox Rep., 8:143-150, 2003. 

[16] GM. Ferreira and B. Spira. The pst operon of enteropathogenic escherichia coli enhances bacterial adherence 
to epithehal cefls. Microbiology, 154:2025-2036, 2008. 

[17] D. Gillespie. Exact stochastic simulation of coupled chemical reactions. /. Phys. Chem., 81:2340-2361, 
1977. 

[18] J. Krivine, R. Milner, and A. Troina. Stochastic bigraphs. Electron. Notes Theor Comput. Sci., 218:73-96, 
2008. 

[19] R. Lanotte, A. Maggiolo-Schettini, and A. Troina. Decidability results for parametric probabilistic transition 
systems with an application to security. In SEFM'04, pages 114-121. IEEE Computer Society, 2004. 

[20] R. Lanotte, A. Maggiolo-Schettini, and A. Troina. Parametric probabilistic transition systems for system 
design and analysis. Formal Asp. Comput., 19(1):93-109, 2007. 

[21] H. Lodish, A. Berk, L.S. Zipursky, P. Matsudaira, D. Baltimore, and J. Darnell. Many Bacterial Responses 
Are Controlled by Two-Component Regulatory Systems Molecular Cell Biology. W.H. Freeman, 2000. 



98 



Stochastic Calculus of Wrapped Compartments 



[22] H. Matsuno, A. Doi, M. Nagasaki, and S. Miyano. Hybrid petri net representation of gene regulatory network. 
In Prooceedings of Pacific Symposium on Biocomputing, pages 341-352. World Scientific Press, 2000. 

[23] C. Priami. Stochastic pi-calculus. Comput. J., 38(7):578-589, 1995. 

[24] C. Priami, A. Regev, E. Y. Shapiro, and W. Silverman. Application of a stochastic name-passing calculus to 
representation and simulation of molecular processes. Inf. Process. Lett., 80(1):25-31, 2001. 

[25] G. Paun. Membrane computing. An introduction. Springer, 2002. 

[26] A. Regev and E. Shapiro. Cells as computation. Nature, 419:343, 2002. 

[27] G. Scatena. Development of a stochastic simulator for biological systems based on the calculus of looping 
sequences. Master's thesis. University of Pisa, 2007. 

[28] SJ. Singer and GL. Nicolson. The fluid mosaic model of the structure of ceU membranes. Science, 175:720- 
731, 1972. 

[29] EE Vernon-Wilson, E. Aurad, L. Tian, IC. Rowe, MJ. Shipston, J. Savill, and SB. Brown. CD31 delays 
phagocyte membrane repolarization to promote efficient binding of apoptotic ceUs. JLeukoc Biol., 82: 1278- 
1288, 1995. 



